Mining clusters and corresponding interpretable descriptions - a three-stage approach

نویسندگان

  • Mario Drobics
  • Ulrich Bodenhofer
  • Werner Winiwarter
چکیده

This paper presents a three-stage approach to data mining which puts special emphasis on the visualization and interpretability of the results. In the first stage, the input data is represented by a selforganizing map in order to allow visualization and to reduce the amount of data while removing noise, outliers, and missing values. Then this preprocessed information is used to identify and display fuzzy clusters of similarity. Finally, descriptions close to natural language are computed for these clusters in order to provide the analyst with qualitative information. This is accomplished by generating fuzzy rules using an inductive learning method. The proposed approach is applied to three case studies, including image data and real-world data sets. The results illustrate the robustness, intuitiveness and wide applicability of the method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying a decision support system for accident analysis by using data mining approach: A case study on one of the Iranian manufactures

Uncertain and stochastic states have been always taken into consideration in the fields of risk management and accident, like other fields of industrial engineering, and have made decision making difficult and complicated for managers in corrective action selection and control measure approach. In this research, huge data sets of the accidents of a manufacturing and industrial unit have been st...

متن کامل

A clustering approach for mineral potential mapping: A deposit-scale porphyry copper exploration targeting

This work describes a knowledge-guided clustering approach for mineral potential mapping (MPM), by which the optimum number of clusters is derived form a knowledge-driven methodology through a concentration-area (C-A) multifractal analysis. To implement the proposed approach, a case study at the North Narbaghi region in the Saveh, Markazi province of Iran, was investigated to discover porphyry ...

متن کامل

Retaining Customers Using Clustering and Association Rules in Insurance Industry: A Case Study

This study clusters customers and finds the characteristics of different groups in a life insurance company in order to find a way for prediction of customer behavior based on payment. The approach is to use clustering and association rules based on CRISP-DM methodology in data mining. The researcher could classify customers of each policy in three different clusters, using association rules. A...

متن کامل

Description-oriented community detection using exhaustive subgroup discovery

Communities can intuitively be defined as subsets of nodes of a graph with a dense structure in the corresponding subgraph. However, for mining such communities usually only structural aspects are taken into account. Typically, no concise nor easily interpretable community description is provided. For tackling this issue, this paper focuses on description-oriented community detection using subg...

متن کامل

Data Mining Using Synergies Between Self-Organizing Maps and Inductive Learning of Fuzzy Rules

Identifying structures in large data sets raises a number of problems. On the one hand, many methods cannot be applied to larger data sets, while, on the other hand, the results are often hard to interpret. We address these problems by a novel three-stage approach. First, we compute a small representation of the input data using a self-organizing map. This reduces the amount of data and allows ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Expert Systems

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2002